Joint, Adaptive Control of Equalization, Synchronization, and Gain in a Digital 

Communications Receiver 

Field of Invention 

The present invention relates to the control of functions in a communications receiver 
that govem physical layer operations of equalization, synchronization (carrier frequency 
and phase, and timing recovery), and automatic gain control. These functions are 
controlled from a central signal processing and logic block to achieve robust, autonomous 
receiver operation. 

Background of Invention 

In many digital communication systems, a source generates digital information, such as 
data, audio, or video, that is to be transmitted to multiple receivers. The digital 
information bits are divided into blocks that define a discrete alphabet of symbols. These 
symbols are used to modulate a radio frequency (RF) carrier's frequency, amplitude 
and/or phase. For example, a quadrature oscillator can be used to modulate the symbols 
onto the amplitude and phase of the RF carrier, and the signaling is referred to as 
Quadrature Amplitude Modulation (QAM). The time between adjacent symbols is 
referred to as the baud (or symbol) period, and inverse as baud (or symbol) rate. 

The data-bearing RF carrier is amplified and transmitted through a propagation medium, 
for example, a cable, phone line, wireless terrestrial, satellite, underwater link, etc. As 
consumer appetite for information on demand continues to increase, so do requirements 
on data rates of present and emerging systems. Higher data rates increase the 
susceptibility of the data-bearing RF signal to environment-induced distortions such as 
fades, multipath (causing inter-symbol interference (ISI)), Doppler conditions, and 
additive noise processes. Communications receivers typically implement multiple stages 
of signal processing to individually address impairments witnessed by the RF signal en 
route to the receiver, and also introduced by the receiver and transmitter themselves, for 
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example, due to implementation with finite-length filters and finite-precision arithmetic. 
Typical fimctions that govern physical layer operation include equalization, 
synchronization to carrier phase and firequency, synchronization to baud sampling phase 
and frequency, and automatic gain control. 

Gain control processing adjusts the received signal to within the proper dynamic range 
expected by the receiver. Usually, gain control is itself done in stages, for example, at the 
RF level by the tuner, after RF to IF translation, and digitally within fi-ont-end and back- 
end signal processing. Most commercial UHFAW tuners include automatic gain 
control (AGC) circuits that tradeoff performance between adjacent and co-channel 
interference using average or peak power levels. AGC at the IF level usually adjusts the 
signal so that the full dynamic range of analog-to-digital conversion is utilized, without 
saturation. Sometimes, AGC utilizes pilot or reference tones or signals that are known, 
stored, or derived in the receiver to adjust IF AGC circuitry. In the digital domain, AGC 
circuitry may also use pilot or reference aids, or may operate blindly without the use of 
such overheard. For example, USSN 60/341,931, entitled "Self-initializing decision 
feedback equalizer with automatic gain control," by T.J. Endres et al., filed Dec. 17, 
2001, which is incorporated herein by this reference, describes an automatic gain control 
(AGC) circuit that is nested with a decision feedback equalizer (DFE) structure, which 
operates on novel error signals that are blind (do not use training, reference, or pilot aids). 
Similarly, USSN 10/246084, entitled, "Adaptive expanded information capacity for 
communications systems," by C. Long et al., filed September 8, 2002, which is 
incorporated herein by reference, describes a similar feedback AGC circuit that is nested 
with an equalizer and uses decision-directed (DD) error adjustment. 

Timing recovery refers to the process of synchronization to correct baud sampling 
frequency and phase. Sometimes, the oscillator clock that adjusts the analog-to-digital 
converter (ADC) at the input to the receiver is adjusted in fi-equency and phase. 
Alternatively, the sampled data in the receiver can be interpolated to achieve proper baud 
sampling. This second approach can reduce complexity by eliminating costly oscillator 
circuitry needed to control the ADC in the first approach. Timing recovery methods 
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sometimes try to detect zero-crossings of the received signal in the time domain. As 
such, some methods apply filtering techniques to upper and lower band edges of the data 
spectrum. For example, see USPN 5,872,815 by C. StroUe et al, entitled "Apparatus for 
generating timing signals for a digital receiver" which is incorporated by this reference. 
Altematively, another approach uses a decision-directed (DD) technique, usually 
requiring feedback from a decision device, or slicer, or quantizer. The DD methods 
described by Mueller and Muller in "Timing recovery in digital synchronous data 
receivers," IEEE Transactions on Communications, vol. COM-24, no. 5, May 1976, 
which is incorporated by this reference, are widely applied. Other blind algorithms, 
usually applied solely to equalization, have lately been successfully applied to timing 
recovery, too. For example, Guglielmi et al. in "Joint Clock Recovery and Baseband 
Combining for the Diversity Radio Channel," IEEE Transactions on Communications, 
vol. 44. p. 114-117, Jan. 1996, which is incorporated by this reference, jointly applies a 
Constant Modulus Algorithm (CMA), originally proposed by D.N. Godard in "Self- 
recovering equaHzation in two-dimensional data communication systems," IEEE 
Transactions on Communications, vol. 28, no. 1 1, pp. 1867-1875, Oct. 1980, which is 
also incorporated by this reference, for equalization, to joint optimization of equalization 
and timing recovery. This joint optimization is further analyzed by Chung et al. in 
"Timing Recovery Based on Dispersion Minimization," Proceedings of the 2001 
Conference on Information Sciences and Systems, March 2001 which is incorporated by 
reference. 

Equalization in a digital communications receiver is analogous to an equalizer on a stereo 
system: the equalizer filters the distorted, received waveform (from a tape head or RF 
antenna) and tries to mitigate distortions and restore the signal properties of the original 
source. In a digital communications receiver, filter mismatches in transmitter and 
receiver, finite-precision implementation, and propagation channel effects induce inter- 
symbol interference (ISI), in which the receiver processes a signal that contains multiple, 
delayed and weighted copies of the transmitted signal. For example, reflections of the RF 
signal from a large building will induce ISI (or multipath), and reflections from an 
airplane will induce multipath distortion with a time-varying Doppler component. Since 
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the exact channel characteristics are not known apriori at the receiver, the equahzer is 
usually implemented with adaptive methods. Adjustment of filter coefficients can be 
done with trained equalization methods, relying on the embedding of a pre-determined 
training sequence in the transmitted data. Usually, equalizer coefficient convergence 
relies on multiple transmissions of the training sequence, and the channel characteristics 
are also time varying, requiring periodic re-training. The Least Mean Squares (LMS) 
algorithm, which was proposed by Widrow, McCool, and Ball, in The Proceedings of the 
IEEE, vol. 63, no. 4, pp. 719-720, April 1975, which is incorporated by reference, 
minimizes a Mean Squared Error (MSE) cost function and is a stochastic gradient descent 
update rule utilizing the training sequence. Alternatively, blind methods do not rely on a 
reference signal, or derive a reference signal from the data itself, and are therefore 
desirable, since in the absence of a training signal, revenue-generating user data can 
instead be transmitted. A common blind equalization method replaces the reference 
signal in the LMS algorithm with the receiver's best guess at the data, and is referred to 
as Decision Directed LMS (DD-LMS), as proposed in a paper entitled "Techniques for 
adaptive equalization of digital communication systems," by R.W. Lucky, in the Bell 
Systems Technical Journal, vol. 45, no. 2, pp. 255-286, Feb. 1966 which is incorporated 
by reference. Unfortunately, DD-LMS needs a reasonably low percentage of incorrect 
decisions to prevent algorithm divergence, and is therefore impractical from a cold-start 
initialization. Godard's CMA is an attractive alternative that provides robust signal 
acquisition in harsh environments. These algorithms are sometimes applied to linear, 
finite-impulse response (FIR) filters, or decision feedback equalizer (DFE) structures, in 
which an FIR filter is embedded in a feedback loop (so that the overall impulse response 
is infinite) that processes hard decision samples from a decision device, sheer, or 
quantizer. USSN 60/341,931 by T.J. Endres et al. entitled "Self-initializing decision 
feedback equalizer with automatic gain control," which is incorporated by reference, uses 
novel, adaptive combining techniques of soft and hard decision samples in a DFE 
structure with CMA, LMS, and DD-LMS for robust, blind acquisition and re-acquisition. 

Synchronization to carrier frequency and phase is usually done in stages, including, for 
example, RF to IF translation, IF to passband (or near baseband), and translation to 



ATLUBOI 1481449.2 



4 



precise baseband. Since the receiver in a coherent communications system must 
ultimately make hard decisions for symbol estimates, the data-bearing RF signal must 
ultimately be brought down to precise baseband. Synchronous or quasi-synchronous 
detectors can be used for intermediate translations. Translation to precise baseband can 
be done in a DD fashion, by nesting a phase-locked loop with the sheer and equalizer. 
For example, the instantaneous phase offset between sheer input and output is detected, 
filtered, and used to drive an oscillator in "Digital Communication" by E.A. Lee and D. 
G. Messerschmitt, Kluwer Academic Publishers, 1994, which is incorporated by 
reference. The PLL can be closed at various points within the equalizer. For example, 
StroUe et al describe alternative DFE architectures for reception of digital television 
signals that include a DD carrier loop, deriving its input from slicer input and output, but 
derotating the received signal to precise baseband at various points within the DFE 
structure. (See: Feasibility of reliable S-VSB reception;' C.H. Strolle, S.N. Hulyalkar, 
T.J. Endres, Proceedings of the NAB Broadcast Engineering Conference, Las Vegas, NV, 
pp. 483-488, April 8-13, 2000, which is incorporated by reference.) 

The present invention relates to the joint, adaptive, control of physical layer functions 
that govem equalization, baseband synchronization to precise carrier phase and 
frequency, baud sampling phase and frequency, and automatic gain control in the digital 
domain. 

Summary of Invention 

hi accordance with certain aspects and embodiments of the present invention, physical 
layer functions goveming equalization, translation to precise baseband, synchronization 
of sampling phase and frequency, and automatic gain control in the digital domain are 
jointly controlled and adapted with a central signal processing and logic block. 

Brief Description of Drawings 
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Other aspects, features, and advantages of various aspects and embodiments of the 
present invention will become more fully apparent from the following detailed 
description, the appended claims, and the accompanying drawings in which: 

Figure 1 shows a typical prior-art communication system that may be employed 
for transmission of digital signals; 

Figure 2 shows a block diagram of timing recovery, equalization, carrier recovery, 
and automatic gain control, jointly controlled by a central signal processing block in 
accordance with certain aspects and embodiments of the present invention; 

Figure 3 shows a block diagram of the preferred embodiment of Feedback AGC, 
showing adaptation of gain value at each symbol instance, and error term selection using 
control signals in accordance with certain aspects and embodiments of the present 
invention; 

Figure 4 shows a block diagram of alternative embodiment of Feedback AGC, 
showing error term generation, and bandpass filtering using cascaded low-pass and high- 
pass filter sections, in accordance with certain aspects and embodiments of the present 
invention; 

Figure 5 shows a block diagram of Carrier Loop nested with Equalizer and Sheer, 
used to translate the received signal to precise baseband; 

Figure 6 shows a block diagram of Decision Feedback Equalizer (DFE) in 
accordance with certain aspects and embodiments of the present invention; 

Figure 7 shows a block diagram of Timing Recovery Loop and its control by 
Signal Processing, Logic, and Control Block in accordance with certain aspects and 
embodiments of the present invention; 
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Figure 8 shows a Phase Detector for Timing Recovery Loop and its control by the 
Signal Processing, Logic, and Control Block; 

Figure 9 shows the: Signal Processing, Logic, and Control Block; 

Figure 10 shows a conceptual illustration of combining weight calculation; 

Figure 1 1 shows a circuit used to calculate combining weight A(n) in accordance 
with certain aspects and embodiments of the present invention; 

Figure 12 shows a conceptual drawing illustrating two possible templates used to 
J discern a control signal in accordance with certain aspects and embodiments of the 

present invention; and 

Figure 13 shows a flow diagram illustrating the update of array index £ in 
accordance with certain aspects and embodiments of the present invention. 

Detailed Description 

Figure 1 depicts a typical prior art digital communication system. Transmitter station 100 
is coupled to receiver 150 by propagation medium 147. The propagation medium could 
be a cable, telephone twisted-pair wire, satellite link, terrestrial link, or fiber optic 
connection, for example. Transmitter station 100 includes an information source 110, 
that contains the content such as data, audio, or video, which is to be communicated to 
the receiver 150. The information source 110 is coupled to encoder 120, which formats 
the information in a manner suitable for digital communication, typically in accordance 
with a given standard or protocol. The encoder 120 is coupled to modulator 140, which 
is also coupled to a quadrature oscillator 130. The modulator 140 uses the signal fi-om 
the quadrature oscillator 130 to modulate the encoded information provided by encoder 
120 onto a suitable Radio Frequency (RF) carrier frequency in amplitude and phase. The 
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modulated signal from modulator 140 is coupled to transmit antenna 145 for transmission 
into propagation medium 147. 

The receiver 150 receives the RF signal from propagation medium 147 via receiver 
antenna 149. Receiver antenna 149 is coupled to tuner 160. Tuner 160 is set to receive 
the RF signal in the desired frequency range, while rejecting signals in nearby or adjacent 
frequency ranges. Tuner 160 may provide automatic gain control at the RF frequency 
and also downconvert the received signal to an intermediate frequency (IF) before 
passing the signal to the analog EF Processing block, 162. Analog IF processing prepares 
the signal for analog-to-digital conversion, applying filtering and gain control, before 
passing the signal to the Front End Processing block 165. Front End Processing block 
165 samples the signal with an analog-to-digital converter and may contain further 
automatic gain control and filtering, digital downconversion in frequency, and quadrature 
demodulation to split the signal into in-phase (I) and quadrature-phase (Q) samples. 
Front End Processing block 165 is coupled to Timing Recovery module 170 that 
determines a correct sampling phase. Timing Recovery module 170 may adjust the 
sampling phase by interpolating the data samples, or adjusting the phase and sampling 
frequency of the analog-to-digital converter in Front End Processing block 165. Timing 
Recovery module 170 is coupled to Equalizer 175, which is used to mitigate the 
distortions, such as inter-symbol interference and noise, that are introduced by the 
propagation medium 147, transmitter 100, Tuner 160, IF Processing block 162, receiver 
Front End Processing block 165, and receiver Timing Recovery module 170. Equalizer 
175 is coupled to Carrier Recovery module 180, which detects residual offset in 
frequency and phase. The detected carrier offset in Carrier Recovery module 180 may be 
supplied back to the Equalizer 175 for translation of equalized samples to precise 
baseband, or used to adjust the downconversion process in Front End Processing block 
165, or both. Also coupled to Equahzer 175 is Feedback AGC module 177, which works 
in conjunction with Equalizer 175 to detect and correct rapid amplitude variations in the 
received signal. The output of Equalizer 175 is coupled to Error Correction module 185, 
which detects and corrects bit errors in the recovered bit stream. The Error Correction 
module 185 is coupled to Decoder 190, which decodes the bit stream in accordance with 
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the standard or protocol used in the Encoder 120 of Transmitter 100. The decoded bits 
from Decoder 190 represent the recovered information source, consisting of data, audio, 
or video, and are supplied to a user interface 195. Certain aspects and embodiments of 
the present invention are manifested in the joint, adaptive, blind control of Equalizer 175, 
Carrier Recovery module 180, Timing Recovery module 170, and Feedback AGC 
module 177. 

Figure 2 shows a block diagram of aspects of one embodiment of the present invention, 
including a Timing Recovery module 210, Feedback AGC 240, Carrier Recovery Loop 
250, two Adaptive Equalizer filters 220 and 230, and a Signal Processing, Logic and 
Control Block 260. The Slicer 270 is a nearest-element decision device that outputs the 
point in the QAM source constellation that has closest Euclidean distance to the Slicer's 
input sample. Alternatively, a partial trellis decoder can be used that exploits channel 
coding and does not necessarily make nearest-element decisions. Next, we will describe 
the different operating modes for each of the Timing Recovery, Carrier Recovery, 
Feedback AGC, and Equalization functions that will be controlled by the Signal 
Processing, Logic and Control Block 260. 



Feedback AGC 

A primary goal of the Feedback AGC is to provide quick, robust gain compensation of 
the signal that is input into the Slicer. This gain compensation may be required as the 
result of dynamic multipath conditions or flat fades. Though the equalizer itself provides 
gain compensation of the signal by adjusting the scale of the adaptive filters* coefficients, 
such compensation can be slow, since a plurality of equalizer coefficients (sometimes 
hundreds) must move to scale the data appropriately. Therefore, the Feedback AGC 
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provides a single, strictly positive gain value, and is designed to provide gain 
compensation more quickly than the equalizer is able to do. Two different Feedback 
AGC architectures are disclosed. 

Feedback AGC Preferred Embodiment 

The preferred embodiment of the Feedback AGC in the present invention utilizes 
techniques described in USPTO apphcation no. 60,341,931, entitled "Self-initializing 
decision feedback equalizer with automatic gain control," by TJ. Endres et al., filed Dec. 
17, 2001. These techniques rely on minimization of a specified cost fimction over the 
choice of gain value, and are implemented in an adaptive way with a stochastic gradient 
descent update rule. 

The automatic gain control signal a{n) is a real, strictly positive scalar, that is calculated 
at each baud instance by 

a{n) = p^a(n - 1) - //^ 

da{n-\) 

where is a leakage factor used in practice to mitigate divergence due to finite- 
precision effects or quantization noise, and is chosen less than or equal to unity, but close 
to unity, J is the cost function to be minimized by choice of a(n) , and is a real- 
valued, positive stepsize, chosen less than unity, and governs algorithm convergence rate, 
tracking capabilities, and stochastic jitter. 

Different cost functions can be used, and these different cost functions will define 
different modes of operation that will be controlled by the Signal Processing, Logic and 
Control Block 260 in Figure 2. 

Operational Mode 1: MSE-Like Cost Function 
The first operational mode of the Feedback AGC uses a Mean Squared Error (MSE)-like 
cost function, expressed as 
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J = E{{\ a{n - 1) . w(/i - 1) . e-^<"^ |' -\w{n- 1) 



where ^ is a positive integer and is set to one for the preferred embodiment. This cost 
function penalizes the squared difference in magnitudes between the slicer input and 
output, and is analogous to a mean squared error (MSE) cost function. The partial 
derivative calculation, denoted as ^in-\)^dJ/da(n-l) , assuming correct decisions 
and neglecting the expectation, for ^ = 1 , results in 

1) = - 1). I >v(« 1) I - 1 iHai - 1) |]- 1 >v(/i 

where we have absorbed a factor of two into the stepsize ju^ , and used the fact that 
sign{a{n - 1)) is always one since the automatic gain control signal is strictly positive by 
definition. 

Since the automatic gain control signal, a{n) , is applied to muhiplier 225 in Figure 2 at 
the current sample instance w, but also requires the use of the output of the multiplier 225 
in its calculation, a delay of one sample has been inserted in the calculation of ^(n - 1) 
and the cost function to keep the system causal. 

An alternative embodiment of the present invention uses arbitrary positive integer q in 
the MSE-like cost function. See USPTO application no. 60,341,931 for a description of 
the update term for this case. 

Operational Mode 2: CM-Like Cost Function 
The second operational mode of the Feedback AGC uses a Constant Modulus (CM)-like 
cost function, expressed as 
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J = E{{\ a{n - 1) • - 1) . e''^'^ \' -yf) 



This cost function has the advantage that it does not rely on hard decisions from the 
output of the Slicer. Letting ^ = 2, taking the partial derivative and neglecting the 
expectation (as in the previous case for the MSE-like cost function) results in an error 
term for this cost function expressed as 

Un-\) = a{n-\y\ w{n - 1) f <| a{n - 1) • w{n - 1) f -y) 

The constant y is calculated as y = E{\sf}l E{\sf} , i.e., analogously to the Godard 
radius used in adaptation of the equalizer coefficients. 

An alternative CM-like cost function uses ^ = 1. The error term from the partial 
derivative is found as 

^{n-\) =1 w{n - 1) I .(I a(n ^ 1) • w{n - 1) | -y) 
In this case with ^ = 1 , the Godard radius is calculated bls y = E{\s\^}/E{\s\} , 

Operational Mode 3: Combining of MSE and CM Cost Functions 
The third operational mode of the Feedback AGC uses a linear combination of MSE-like 
and CM-like cost functions. In this case, the error term derived from a MSE-like cost 
function is weighted by 1 - A{n) , while the error term derived from a CM-like cost 
function is weighted hyA(n) , and the results are summed to produce an error term used 

to update the Feedback AGC gain in the stochastic gradient descent rule. This combining 
is expressed as 

iin-l) = (l-A{n))i^,,{n-l)^Ain)i^j,in-\) 
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The combining weight is time-varying and can be chosen among a variety of 
methods. For example, the combining weight can be set based on the signal-to-noise 
ratio, error rate, cluster variance, or number of symbols processed. In the preferred 
embodiment of the present invention, the combining weight is calculated in an adaptive 
method at each baud instance by the techniques discussed in USPTO application no. 
60,341,931, by T.J. Endres et al., entitled "Self-initializing decision feedback equalizer 
with automatic gain control". Altematively, the combining weight can be determined 
from a look-up-table, which is addressed by the array index that is described in the future 
sections of the present document. 

Operational Mode 4: No Update 
The fourth operational mode of the Feedback AGC halts adaptation of the Feedback 

AGC gain value. Li this case, the error term is set to zero, (« - 1) = 0 , and leakage is 

set to one, = 1 . Hence, the new gain value is equal to the old gain value. 

Enhancements: Averaging and Penalty Term 
The instantaneous error term ^(/i-l), calculated in accordance with one of the four 
operational modes previously described, can be averaged before being applied to the 
stochastic gradient descent update rule to calculate the automatic gain control signal 
a(«), to induce memory in and reduce the variance of the error signal. For example, the 
error term used in the stochastic gradient update can be calculated as 

^(« - 1) = (1 - p„^, ) • - 2) + p^^^ . an - 1) 

with automatic gain control signal updated by 

a{n) = p^a{n - 1) - nj{n - 1) 
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where p^^^ is chosen greater than or equal to zero, but less than or equal to one, and is a 
leakage factor. Selecting p^^^ = 1 represents no leakage and induces no memory in the 
error term. In this case, the error term relies purely on the unity-delayed samples and 

To reduce undesired interaction between the EquaUzer and Feedback AGC, a penalty 
term can be added to the cost function. This extra term penalizes gain values that are 
different from a target value, T , For example, the cost function including the penalty 
term is expressed as 

where y(? is a small, non-negative weighting factor for the additional penalty term. The 
automatic gain control value is then updated according to 

a{n) = p,a{n - 1) - ^^^(n {a{n - 1) - T) . 

The product ju^ - p can be collapsed into a single value, to ease implementation. 

Figure 3 shows a block diagram of the preferred embodiment of the Feedback AGC in 
accordance with certain aspects and embodiments of the present invention. Signals 
and the input to the Slicer after derotation and gain scaling, and the Sheer output, 

respectively, are input to the Error Term Generator 310, as well as the current age gain 
value, a{n) . Error Term Generator 310 calculates an instantaneous error term, ^{n - 1) , 
responsive to the control bits from the Signal Processing, Logic, and Control Block (260 
in Figure 2). The Signal Processing, Logic, and Control Block judges the demodulator's 
performance and assigns an error term to be used by the Feedback AGC. (We will later 
see how the adaptation strategy is made completely flexible and programmable by LUT 
swap in the Signal Processing, Logic, and Control Block.) The four operational modes of 
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the Feedback AGC can be associated with the relative performance of the demodulator, 
according to, for example. 



Operational Mode (name/number) 


Stage of Demodulator Operation 


CM-like/2 


Initial acquisition or strong dynamic environment 


Combined CM-like and MSE-like/3 


Secondary acquisition stage or moderate dynamic 
environment 


MSE-like/1 


Tracking stage or mild dynamic environment 


No update/4 


Tracking stage or static environment 







The instantaneous error term ^(n-l) is applied to a leaky integrator, comprised of 
multiplier 315, multiplier 325, and delay element 330, to induce averaging. The stepsize 
/i^ is applied in multiplier 335 to the averaged error term (^{n - 1) . Adder 340 sums the 
update error term, penalty term, and previous AGC gain value to produce an updated 
AGC gain value, a{n). Leakage to avoid finite precision effects can be applied in 
multipHer 345 with leakage factor to the previous AGC gain value, a(n-l), 
supplied by delay element 360. The previous AGC gain value, -1), is also applied 
to summer 350 where a difference is formed with target threshold T . This result is 
multiplied by the weighting factor -jUa 'J^ multiplier 355. The updated AGC gain 
value fi-om adder 340, a{n) , is also fed back to the Error Term Generator 310 to form the 
next instantaneous error term. 

Feedback AGC Alternative Embodiment 

An alternative embodiment of the Feedback AGC uses a bandpass control loop to adjust 
the AGC gain value. The bandpass characteristic is designed so that the rise (or attack) 
time of the Feedback AGC is very fast (about a few symbol periods) compared to the 
decay time of the Feedback AGC (hundreds or thousands of symbol periods). The gain 
value of the Feedback AGC decays to a prescribed threshold, nominally unity, as time 
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goes to infinity. Hence, in a dynamic environment the overall gain of the received signal 
is quickly adjusted by the Feedback AGC and seamlessly transferred to the Equalizer. 

Figure 4 shows a block diagram of this altemative embodiment of the Feedback AGC in 
accordance with certain aspects and embodiments of the present invention. The loop is 
configured as a second order control loop with zero response at DC. The loop is sampled 
at the symbol rate. 

The control loop derives the gain value from an error penalizing the difference of the 
ratio of the magnitudes of soft to hard decisions from a target value, for example, unity. 
The complex magnitude (square root of sum of real and imaginary components squared) 
of the equalizer output sample, is calculated in the CABS block 405. Look-up- 

table 410 stores inverse magnitudes of constellation points, and is addressed using the 
hard decision samples, w(n), or indices associated with these hard decision samples, 
coming from the Slicer. Multiplier 415 forms the ratio of magnitudes of soft to hard 
decision samples, and this is termed the magnitude error and denoted as 

Adder 420 takes the difference of the magnitude error from a target threshold, T, 
nominally unity, and forms the loop error, denoted as 

E.^W=(r-^,W) 

The loop error, E,^„(«), is first filtered by a low pass section that has transfer function 

H{z) = 
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Adder 425 sums the loop error E^{n) with a delayed and scaled version of the low pass 
output. Delay element 435 delays the low pass signal . Control signal reset is used to 
clear the contents of delay element 435 and output a zero- value from mux 440 instead of 
the output of delay 435. The output of mux 440 is multiplied with low pass scale in 
multipHer 430. Low pass scale is used to set the breakpoint of the low pass filter 
section. The output of multiplier 430 is summed with the input to the low pass section in 
adder 425. The low pass signal is scaled by low pass gain, fi^ , in multiplier 445, 

producing the output of the low pass section, /^^ , also used as the input to the high pass 
section. 

The high pass section has transfer function 

W(z)=^""-^''. 

(l-a^z-) 

Signal f^jj is summed with a delayed and scaled version of signal /^j^^ in adder 450, 
producing signal Signal /^^^^ is delayed in delay element 460. Control signal reset 
is used to clear the contents of delay element 460 and output a zero-value from mux 465 
instead of the output of delay element 460. The output of mux 465 is multiplied with 
high pass scale a„ in multiplier 455. High pass scale a„ is used to set the breakpoint of 
the high pass filter section. The output of mux 465 is also subtracted from signal in 
adder 470, producing high pass signal . High pass signal is scaled by high pass 
gain, J3„ , in multiplier 475, producing the high pass output signal, . 

The high pass output is added back to the target threshold, T, in adder 480, 

producing the calculated Feedback AGC gain value. Upon reset, however, mux 485 
selects a unity-valued gain signal instead of the calculated AGC gain value. Using 
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control signal FbagcFixedScaleEn, mux 490 sets the AGC gain value to programmable 
gain value FbagcFixedScale instead of the calculated gain value. 

For a baud rate of approximately 613 KHz, nominal gain and scale values can be set to: 











0.999 


0.999 


0.03 


1.0 



The Signal Processing, Logic and Control Block 260 in Figure 2 controls this alternative 
embodiment of the Feedback AGC in accordance with certain aspects and embodiments 
of the present invention by selecting between one of the following operational modes at 
each baud instance. 

Operational Mode 1: Decision Directed 

When the Feedback AGC is operated in a decision directed (DD) mode, the magnitude 
error is found as 

\w\n\ 

and the filtering is as described above for Figure 4. 

Operational Mode 2: Freeze 

When the Feedback AGC is operated in a Freeze mode, the system clocking of the circuit 
in Figure 4 is disabled, so that all memory contents, including the output AGC gain value, 
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remain unchanged. Effectively, this circuit is bypassed, and there is no change to the 
internal state. 

Operational Mode 3: Fixed Scale 

When the Feedback AGC is in Fixed Scale mode, all internal calculations are performed. 
However, mux 490 in Figure 4 replaces the calculated gain value with the value of the 
programmable register FbagcFixedScale. 

Operational Mode 4: Reset 

When the Feedback AGC is in Reset mode, the accumulators 435 and 460 are set to zero, 
mux's 440 and 465 output a zero- valued sample, and mux 485 outputs a unity- valued 
sample for the AGC gain value. 

Carrier Loop 

The goal of the final stage of carrier recovery is to translate the received signal to precise 
baseband in the synchronous demodulator. A precise baseband signal is needed to form 
hard decision samples, or symbol estimates, in the Sheer. A Carrier Recovery loop is 
used to track out any residual carrier frequency or phase offset, and phase noise 
introduced in the demodulation process. In Figure 2, the conjugated output e"^ '^^"^ of the 
Carrier Loop 250 is used to de-rotate the passband Equalizer output to precise baseband 
in multiplier 225, while the output e^^'^^"^ of the Carrier Loop 250 is used to re-rotate the 
baseband Sheer output to passband in multiplier 235. 

The Carrier Loop 250 is described in more detail in Figure 5. This carrier loop uses a 
decision-directed (DD) phase error in a second order, (digital) phase locked loop with a 
numerically-controlled oscillator (NCO) to produce the sin and cosine terms needed to 
perform the de-rotation and re-rotation of Equalizer and Sheer outputs, respectively. 
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Decision-directed phase and frequency recovery is discussed in the textbook by Lee and 
Messerchmitt, entitled Digital Communication, Appendix 17-B, Kluwer Academic 
Publishers, Norwell, MA, Second Edition, 1994, which is incorporated by this reference. 

In Figure 5, the baseband equalizer output, w= u^^ +7 • is supplied as input to Slicer 

270, and also to Coarse Tune Hard Decision Block 505. This block is not discussed in 
Lee and Messerschmitt, and is used in certain aspects and embodiments of the present 
invention. Coarse Tune Hard Decision Block 505 provides an alternative hard decision 
sample than the one supplied by the Slicer, a nearest-element decision device. The 
alternative hard decision samples can be used in more of an acquisition state, while the 
hard decision samples from the Slicer are generally more reserved for tracking or steady- 
state demodulator operation. Coarse Tune Hard Decision Block 505 can be configured in 
various operating modes to produce alternative hard decision samples. 

Coarse Tune Hard Decision Operating Mode 7, Linear Combination of Soft and Hard: 
In this preferred embodiment of the Coarse Tune Hard Decision Block 505, the 
alternative hard decision sample, , is formed by taking linear combinations of soft 
decision samples from the equalizer and hard decision samples from the Slicer, 

=(l-/l)-W + /l- w 

where the real-valued combing weight X is greater than or equal to zero but less than or 
equal to unity, and is selected adaptively. Methods to determine the combining weight 
are described in USSN. 60/341,931, entitled "Self-initializing decision feedback equalizer 
with automatic gain control," by T.J. Endres et al., filed Dec. 17, 2001, which is 
incorporated by reference, and will also be described in the sequel. 

Coarse Tune Hard Decision Operating Mode 2, Quadrant Mapping: 

An altemative embodiment of the present invention uses Quadrant Mapping of the soft 

decision sample. The basic goal of generating an altemative hard decision sample using 
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Quadrant Mapping is to estimate the quadrant in I/Q space in which the soft decision 
sample belongs, and scale the sample depending on the modulation format. This simple 
operation is expressed by selecting the altemative hard decision sample, , according to 

= A • {signiw,) + J . signiwg)) 

where A is a real- valued, positive scalar that can be selected according to the modulation 
format. 

Coarse Tune Hard Decision Operating Mode 3, Reduced Constellation: 
Certain altemative embodiments of the present invention quantize the soft decision 
sample not to the nearest element of the source alphabet (as the Slicer does), but to the 
nearest element of a subset of the source alphabet. This is referred to as a reduced 
constellation since only part of the source constellation is used to generate the altemative 
hard decision sample. 

Coarse Tune Hard Decision Operating Mode 4, Reduced Precision: 
Suppose that the baseband soft decision sample, w , is represented by M bits, while the 
Slicer's hard decision sample needs only iV^ bits to be exactly represented, with M>N. An 
altemative hard decision sample is found by rounding and saturating the end points of the 
soft decision sample, to a bit-width P, where M>P>N. For example, suppose the soft 
decision sample is represented by M=J6 bits, but only N=7 bits are needed to precisely 
represent the Slicer hard decision sample on the constellation grid. Then an altemative 
hard decision sample can be found by formatting the soft decision down to P=9 or P=10 
bits. 

Coarse Tune Hard Decision Operating Mode 5, Arbitrary Nonlinearity: 
The Coarse Tune Hard Decision Block 505 can also generate altemative hard decision 
samples according to an arbitrary nonlinear curve, without memory. For example, a 
hyperbolic tangent curve can be approximated. 
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These operating modes are selected using input signal "control 3" to Coarse Tune Hard 
Decision Block 505. Input signal "control 3" is supplied from the Signal Processing, 
Logic, and Control Block (260 in Figure 2) at each baud instance in accordance with 
certain aspects and embodiments of the present invention. 



The alternative hard decision sample that is output from The Coarse Tune Hard Decision 
Block 505 in Figure 5 is supplied to mux 510. Mux 510 selects which hard decision 
sample is supplied to the Phase Detector 515, from among the hard decision sample from 
the Sheer and the alternative hard decision sample from the Coarse Tune Hard Decision 
Block 505. Mux 510 is controlled by input signal "control 1" that is supplied from the 
Signal Processing, Logic, and Control Block (260 in Figure 2) at each baud instance in 
accordance with certain aspects and embodiments of the present invention. 



The Phase Detector 515 derives an instantaneous estimate of the phase error between soft 
and hard decisions, and is described in Lee and Messerchmitt, Digital Communication, 
Appendix 17-B, Kluwer Academic Publishers, Norwell, MA, Second Edition, 1994, 
which is incorporated by reference. Using a small angle approximation, the phase error 
estimate is calculated according to 



where w-w^ -\-j-WQ is the baseband soft decision sample, and w = + 7 • is the 

hard decision sample supplied from mux 510. In practice, the denominator is sometimes 
omitted. 

For the preferred embodiment of the Coarse Tune Hard Decision Block 505, using 
Operating Mode 1, the hard decision sample used in the phase error calculation is the 
weighted sum of soft decision samples from the Equalizer and hard decision samples 
from the Sheer, using adaptive combining weight X . When A = 0, signal quality is very 
good, and the instantaneous phase estimate is calculated with a hard decision sample that 
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is equal to the Slicer hard decision sample, as intended in Lee and Messerchmitt, Digital 
Communication, Appendix 17-B, Kluwer Academic Publishers, Norwell, MA, Second 
Edition, 1994. However, when X = 1 and signal quality is poor, the instantaneous phase 
estimate, ^, collapses to zero, which corresponds to no update to the phase and 
frequency produced by the PLL. In an acquisition state of the demodulator, when signal 
quality is poor, and hard decision samples are more unreliable, it is desirable to halt 
adjustment of frequency and phase that could slow or event prevent convergence of the 
loop. Hence, adaptively combining soft decisions from the Equalizer and hard decisions 
from the Slicer, the PLL is automatically transitioned among acquisition and tracking 
states, and halted (or slowed) when signal quality worsens and its inputs become 
unreliable. 

The output of Phase Detector 515 is supplied to mux 520. Mux 520 selects from among 
Phase Detector 515 output or a zero-value. Mux 520 is controlled by signal "control 2", 
that is supplied from the Signal Processing, Logic, and Control Block (260 in Figure 2) at 
each baud instance in accordance with certain aspects and embodiments of the present 
invention. 

The output of Mux 520 is supplied to Loop Filter 599. Multiplier 530 applies 
proportional control by multiplication with Tj , a programmable real-valued scalar, while 
multiplier 525 applies integral control by multiplication with , a programmable real- 
valued scalar. The output of multiplier 525 is then integrated in adder 535 and delay 
element 540. The output of adder 535 is the integral control, and is summed in adder 545 
with the proportional control from multiplier 530. Adder 545 supplies the Loop Filter 
599 output to the NCO 598. 

Numerically-controlled oscillator (NCO) 598 integrates the signal from adder 545 (Loop 
Filter 599 output) in adder 550 and delay element 555. The output of adder 550, 
representing instantaneous phase of the desired sinusoid, is supplied to a Sin/Cos 
Generator 560. Sin/Cos Generator 560 uses prior art techniques, such as LUT storage of 
sinusoid, in whole or part, or direct calculation of approximation by Taylor (or other) 
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series, for example, to provide sine and cosine terms. These sine and cosine terms are 
used to de-rotate the Equalizer soft decision sample by e'^^^"^ , and also to re-rotate the 
Slicer hard decision sample by e^^'^^"^ . 

Equalization 

One preferred embodiment of the Equalizer portion of the present invention is extracted 
from Figure 2 and drawn in Figure 6. This equalizer architecture is a Decision Feedback 
Equalizer (DFE) with both forward and feedback filters operating in the passband, that 
uses prior art techniques described in USSN 60/341,931, entitled "Self-initializing 
decision feedback equalizer with automatic gain control," by TJ. Endres et al, filed Dec. 
17, 2001 which is incorporated by reference. 

In Figure 6, the received signal, r(n) , is a passband sample that is supplied from Timing 
Recovery Block 210 in Figure 2. Forward filter 220 may operate at the baud rate or 
faster, in which case the equalizer is said to be fractionally-spaced, and exploits temporal 
diversity. Also, the forward filter 220 may receive multiple inputs, as from multiple 
antennas, to exploit spatial diversity. Temporal or spatial diversity use a multi-chaimel 
forward filter. For simplicity, however, a single forward filter 220 is shown, and 
extension to a multi-channel model is understood by one skilled in the art. 

Forward filter 220 is a finite impulse response (FIR) filter, computing its output 
according to the convolution sum 

x(n) = /o(ai) . r(n) -hf.in) • r(n - 1) +f,in) • r{n - 2) + ... + /,^., (n) - r{n - 1^ + 1) 

where r(n) is the sample sequence input to forward filter 220, x{n) is the output sample 
sequence of forward filter 220, f. are the forward filter coefficients (or parameters,) and 
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Lj^ is the number of forward fiher coefficients. Note that the forward filter coefficients 
are also shown with time index n to indicate that the forward filter 220 is adaptive. 

The feedback filter 230 is not multi-channel, and is a FIR filter that calculates its output 
according to the convolution sum 

y{n) = go(«) • +g, («) • v(rt - 1) («) • v(w - 2) + ... + g^^ («) • v{n - + 1) 

where v{n) is the sample sequence input to feedback filter 230, y{n) is the output 

sample sequence of feedback filter 230, g. are the feedback filter coefficients (or 

parameters,) and is the number of feedback filter coefficients. Note that the feedback 

filter coefficients are also shown with time index n to indicate that the feedback filter 
230 is adaptive. Though the feedback filter 230 is a FIR filter, it is embedded in a 
feedback loop, so that the equalizer has an overall impulse response that is infinite. 

Adaptation of the forward filter 220 coefficients and feedback filter 230 coefficients uses 
a stochastic gradient descent update rule: 

/(A^ + l) = y;.(n)-//^/(«).e(n) 
giin + 1) = gi(n) - //^ • (p\n) • e{n) 

where (•)* represents complex conjugation, and ju^ and //^ are small, positive stepsizes 
governing algorithm convergence rate, tracking capabilities and stochastic jitter. Using 
simplified updates, the data used in the adaptation equations are set to xi^) = ^(«) 
(p{n) = v(n) . The error term, e(n) , depends on the operating mode of the Equalizer and 
is controlled by the Signal Processing, Logic and Control Block 260 (in Figure 2). The 
various operating modes will be discussed later in this secfion. Setting = ^i^) and 
(p(n) = v(n) in these equations used to adapt forward filter 220 and feedback filter 230 
coefficients is referred to as "simplified updates," since the step known as regressor 
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filtering is omitted. True cost fiinction minimization requires an extra stage of filtering 
for the regressor data of the forward filter 220 and the feedback filter 230 in the 
adaptation process, using the current equalizer coefficients. Such regressor filtering is 
often omitted in practice to ease implementation. Regressor filtering is described in 
Chapter 5 of "Theory and design of adaptive filters" by J.R. Treichler, C.R. Johnson, Jr., 
and M.G. Larimore, Prentice Hall, 2001 which is incorporated by reference. One skilled 
in the art would recognize how to modify the regressor data used in the adaptation 
equations above to incorporate the extra stage of regressor filtering. 

Adder 215 combines the outputs of forward filter 220 and feedback filter 230, x{n) and 
y{n) , respectively, to form sample sequence w(n) . Sample sequence w{n) is referred to 
as passband soft decision samples. Multiplier 225 applies de-rotation fi-om the Carrier 
Loop 250 by e"^^*"^ and gain scaling from the Feedback AGC 240 by a{n) . The output 
of multiplier 225 is denoted w{n) and referred to as baseband soft decision samples. 
These basband soft decision samples are input to Sheer 270. Slicer 270 is a nearest- 
element decision device that outputs a hard decision, w(n) , corresponding to the source 
alphabet member with closest Euclidean distance to its input sample. The hard decisions, 
w(n) , fi"om slicer 270 are re-rotated to the passband by application of Carrier Loop 250 

signal e''^'^^"^ , and re-scaled by Feedback AGC 240 signal a'\n) in multiplier 235. 

The output of multipher 235 is weighted by l-A(/i) in multiplier 245, and the signal 
w{n) is weighted by A{n) in multiplier 265. Signals l-A{n) and are real- valued, 
adaptive combining weights provided by Signal Processing, Logic, and Control Block 
260 (in Figure 2) and are greater than or equal to zero, but less than or equal to unity. 
The weighted signals fi-om multipliers 245 and 265 are summed in adder 255 to form 
feedback sample v(«) , used as input data to feedback filter 230. 

The operational mode of the Equalizer is determined by error term, e{n) , whose selection 
is governed by the Signal Processing, Logic, and Control Block 260 (in Figure 2). These 
operational modes are next described. 
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Operational Mode 1: LMS 



The Least Mean Squares (LMS) algorithm minimizes a Mean Squared Error (MSE) cost 
function that penalizes the squared difference between source sequence s{n) (in Figure 
1) fi*om the transmitter, and Equalizer output w(w). In practice, the Receiver 150 in 

Figure 1 stores or derives a replica of a training sequence, and the trained LMS algorithm 
uses error term 



so that e{n) is set to e,^,(«) . The baseband error term is re-rotated back to passband by 

multiplying the baseband error term with the sine and cosine terms from Carrier Loop 
250. The training sequence is usually periodically resent, wasting valuable bandwidth. 

In the absence of a training sequence, the output of Slicer 270 is used instead. Replacing 
s{n) with in the LMS error term is called Decision Directed (DD) LMS and uses 
the error term 



so thate(«) is set to e/„j(«) . 

When this operational mode is selected, training sequence data is used in the LMS 
calculation; else, hard decision samples are used in the LMS calculation. It is understood 
that modifications to these LMS error terms, using MSE techniques, to suit the specific 
application, are obvious to one skilled in the art. 

Operational Mode 2: CMA 
The Constant Modulus Algorithm (CMA) minimizes the Constant Modulus (CM) cost 
function and calculates an error term without the use of a training sequence. Hence, it is 
classified as a blind algorithm. The CM criterion penalizes the dispersion of the 



eiM = {^{n)-s{n))e 
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magnitude squared of the equalizer output sample from a predetermined scalar. The 
preferred embodiment of the present invention calculates the CMA error term according 
to 

where ;^ is a predetermined, real-valued scalar referred to as the Godard radius, or 
dispersion constant, and is usually determined from the source alphabet 
asy = E{\s{n)\'^}/ E{\s{n)\^} . Notice that the passband soft decision sample w{n)is 
used in the CMA error term calculation, so that there is no muUiplication required with 
sine and cosine terms. This sample is prior to Feedback AGC gain scaling in muhiplier 
225 (Figure 6). The inventors have determined that the use of soft decision sample prior 
to gain scaling in the CMA error term calculation reduces undesirable interaction 
between Equalizer 220 and Feedback AGC 240, compared to using the soft decision 
sample after gain scaling. In this case, e(n) is set to e^^^(n) . 

An altemative embodiment of the present invention calculates the CMA error tern 
according to 

In this case, e{n) is set to e^^^{n). 

These CMA error terms described are of order 2, as described by Godard. It is 
understood that modifications to these CMA error terms, using constant modulus 
techniques, to suit the specific application, are obvious to one skilled in the art. 
Furthermore, related Bussgang techniques, as summarized in chapter 2 of "Blind 
Deconvolution," Prentice Hall, written by S. Bellini, edited by S. Haykin, 1994, which 
work is incorporated by this reference, are generalizations of constant modulus 
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techniques, and can be substituted for the described CMA error terms by one skilled in 
the art. 

Operational Mode 3: Combined LMS and CMA 
Using the adaptive combing weights 1 - Z{n) and A{n) provided by Signal Processing, 

Logic, and Control Block 260 (in Figure 2), a combined error term is calculated 
according to 

e{n) = Ain) • e,^ (n) + (1 - - e,^„ (n) . 

Operational Mode 4: Halt Adaptation 
In this operational mode, the equalizer coefficients remain unchanged, so that the error 
term is 

e(«) = 0. 

The error terms described above are for the Equalizer in Figure 6 that has adaptive filters 
which operate in the passband. It is understood that modifications to these error terms for 
equalizer architetctures with adaptive filters in the baseband, or for equalizer 
architectures with some adaptive filters operating in passband and others in baseband, are 
obvious to one skilled in the art. 

Timing Recovery 

The purpose of the Timing Recovery Loop (170 in Figure 1) is to adjust the sampling 
phase and frequency of the received data. When the sampling epochs are adjusted to 
correspond to top-dead-center of the pulse shape, no ISI is introduced from neighboring 
symbols, since the pulse shape (for example, a square root raised cosine filter) is usually 
designed to be zero-valued at integer multiples of the baud rate. 
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To adjust the sampling epoch, most systems either adjust the sampUng phase and 
frequency of the analog-to-digital converter (ADC), or interpolate the sampled data 
somewhere in the signal processing chain. Figure 7 captures both of these approaches in 
a block diagram of Timing Recovery Loop 170 interfaced with Signal Processing, Logic, 
and Control Block 260 in accordance with certain aspects and embodiments of the 
present invention. 

In Figure 7, the received analog data r^{t) is sampled in Sample Generator 710, 

producing sampled data sequence r(n) = r^(nT + r), where T is the sampling period and 

r is the sampling phase. Sample Generator 710 may include an ADC with sampling 
period and phase that are adjusted by a clock provided by Reference Generator 740. 
Alternatively, Sample Generator 710 may include an ADC that samples data with a free- 
running clock, followed by a digital interpolator that is controlled by Reference 
Generator 740, for example, using a Numerically Controlled Oscillator (NCO). 

Phase Detector 720 receives the sampled data r{n), and computes a phase error 
responsive to signals from the Signal Processing, Logic and Control Block 260 in 
accordance with certain aspects and embodiments of the present invention. Loop Filter 
730 uses proportional and integral control to filter high-frequency signal components and 
integrate the phase error for sampling frequency adjustment. The Reference Generator 
740 uses the loop error signal from Loop Filter 730 to provide control signals for 
sampling frequency and phase adjustment in Sample Generator 710. 

In accordance with certain aspects and embodiments of the present invention, the 
operating mode of the Timing Recovery Loop 170 is determined from the signals 
provided by the Signal Processing, Logic, and Control Block 260 by selecting the 
criterion by which the phase error signal is calculated in the Phase Detector 720. Figure 
8 shows a block diagram of the logic in Phase Detector 720 using the control signals 
provided by the Signal Processing, Logic, and Control Block 260 to calculate a phase 
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error signal {[/{n) . The details of this block diagram will be apparent as the operating 
modes of Timing Recovery Loop 170 are discussed. 



Operating Mode 1: Band Edge 
Band edge timing recovery methods are prior art techniques basically based on the 
maximization of Signal to Noise Ratio (SNR) of sampler output. Band edge timing 
recovery schemes such as in USPN 5,872,815 by C. StroUe et al, entitled "Apparatus for 
generating timing signals for a digital receiver," which is incorporated by this reference, 
are implemented in pass band by exploiting the band edges of the data spectrum in order 
to obtain the timing phase maximizing the sampler output energy. 

Phase Detector 720 computes the base band equivalent band edge timing phase error 
signal by differentiating the following base band output energy cost function 

7 = («) 

or or 

dr 

where e^^C/i) is an error term (in this case derived from r(n) ) and — is approximated in 

dr 

a variety or prior art methods, for example, using FIR filters in Lee and Messerchmitt, 
Digital Communication, Appendix 17-B, Kluwer Academic Publishers, Norwell, MA, 
Second Edition, 1994 which is incorporated by reference. 

With the cost function defined as 



JoBMinT)^^\riT)\' 

the band-edge phase error is 
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This operating mode is selected by the Signal Processing, Logic, and Control Block 260 
through the four control bits D^[^ . Bit Dj^ instructs Phase Detector 720 to calculate a 
phase error signal with its own internal data, rather than using data provided from 
Equalizer 175. The remaining control bits, Z)}'^ , instruct Phase Detector 720 to calculate 
the specific error term selected by Signal Processing, Logic and Control Block 260. 

Band edge timing recovery techniques work well for blind acquisition, provided there is 
sufficient signal power in the band edges. However, a single echo multipath ray can 
attenuate the signal power in the band edges, and prevent phase lock. 



An alternative blind approach to timing recovery minimizes a CM cost function, 
J CM {^jT) , of the sampled data r{n) with respect to timing phase r , as proposed by 

Guglielmi et al. in "Joint Clock Recovery and Baseband Combining for the Diversity 
Radio Channel," IEEE Transactions on Communications, vol. 44. pp 1 14-117, Jan. 1996, 
which is incorporated by reference, and further described by Chung et al. in "Timing 
Recovery Based on Dispersion Minimization," Proceedings of the 2001 Conference on 
Information Sciences and Systems, Mar. 2001, which is incorporated by this reference. 

The phase error is found by differentiating the cost function with respect to timing phase, 
r . Using calculus, this differentiation is usually decomposed into 



Operating Mode 2: Calculate CMA 



dr 
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where ej.^(n) is an error term (in this case derived from r{n) ) and — is approximated in 

dr 

a variety or prior art methods, for example, using FIR fiUers in Lee and Messerchmitt, 
Digital Communication, Appendix 17-B, Kluwer Academic Publishers, Norwell, MA, 
Second Edition, 1994 which is incorporated by reference. (This same decomposition is 
used for each of the phase error signals that we will describe and are derived by 
minimizing a cost function.) The CMA error term is calculated as 

eTLin) = r(n) {\r{n)\' -y) 

The CMA error term calculated by Phase Detector 720 is selected by Signal Processing, 
Logic, and Control Block 260 using control bits D^l^ . Bit D^^ instructs Phase Detector 
720 to supply a phase error signal that is calculated with an error term using its own 
intemal data, rather than using the error term provided by the equalizer. In Figure 8, bit 
Z)°^ controls mux 835 for this selection. Control bits, D]^^ address mux 830 and are set 
according to the mux control table to select CMA. The derivative calculation in 
Derivative Calculator 810 uses sampled data r{n) . The error term provided by mux 830 
is multiplied in multiplier 825 by the derivative provided by Derivative Calculator 810, 
creating the phase error signal. 

Operating Mode 3: Calculate LMS 
A Mean Squared Error (MSE) style cost function Jmse^^^^^) is minimized over the 
sampled data r{n) with respect to timing phase r by Chung in "Blind Parameter 
Estimation for Data Acquisition in Digital Communications Systems". Ph.D. 
Dissertation, Cornell University, Ithaca, NY, Aug. 2002 which is incorporated by 
reference. 
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The phase error is found by differentiating the cost function with respect to timing phase, 
T , and is of the form 

=1 r{n)-r{n)\ 

where r{n) is a locally generated or stored hard decision estimate of the current received 
sample, for example, from a training sequence, a local (to Timing Recovery Loop 170) 
slicer, a reduced constellation mapping, or other method. 

The LMS error term calculated by Phase Detector 720 is selected by Signal Processing, 
Logic, and Control Block 260 using control bits D^l^ . Bit Z)^^ instructs Phase Detector 
720 to supply a phase error signal that is calculated with an error term using its own 
internal data, rather than using the error term provided by the equalizer. In Figure 8, bit 
Dj^ controls mux 835 for this selection. Control bits, D]^^ address mux 830 and are set 
according to the mux control table to select LMS. 

Operating Mode 4: Calculate Combo Error Term 
Using the adaptive combining weight, 2.{n) , provided by the Signal Processing, Logic, 
and Control Block 260, a combination error term, comprised of CMA and LMS error 
terms calculated in Operating Modes 2 and 3, is calculated by Phase Detector 720 
according to 

= (1 - X{n)) • e^M^in) + X{n) • e^^W 

In Figure 8, the adaptive combining weight, X{n) , is supplied to Error Term Calculator 
820, and control bits supplied by Signal Processing, Logic, and Control Block 260 
are set to select the combination error term in mux 830. Control bit Z)^^ selects a phase 
error in mux 835 that is supplied by multiplier 825. 
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Operating Mode 5: No Update 
In this operating mode, the phase error is set to zero. Control bit selects a phase 
error in mux 835 that is supplied by multiplier 825, while control bits Z)}^^ select a zero- 
valued error term in mux 830. 

Operating Mode 6: Use Error Term from Equalizer 
In this operating mode, the error term calculated in Equalizer 175 is used in the phase 
error calculation. In Figure 8, Derivative Calculator calculates a derivative using the 
equalizer output instead of the received data r{n) . Control bit Dj^ selects a phase 
error in mux 835 that is supplied by multiplier 815, which multiplies the derivative from 
Derivative Calculator 810 with the equalizer error term e{n) . 

Joint, Adaptive Control of Equalization, Synchronization, andAGC 

A block diagram of the Signal Processing, Logic, and Control Block 260 in accordance 
with certain aspects and embodiments of the present invention is shown in Figure 9. 
Error Term and Combining Weight Calculator 905 has two functions: (i) to provide 
candidate error terms (like CMA, LMS, and their combination) that are calculated from 
equalizer samples, and (ii) to provide combining weight A{n) . 

To calculate candidate error terms, baseband and passband Equalizer 175 (Figure 1) 
output samples, w(n) and w(n) , respectively, Sheer 270 ( Figure 2) hard decision output, 
w(n), and sine and cosine terms from Carrier Loop 180 are supplied to Error Term and 
Combining Weight Calculator 905. The combining weight A(n) can be calculated by 
methods in accordance with USSN 60/341,931, entitled "Self-initializing decision 
feedback equalizer with automatic gain control," by T.J. Endres et al., filed Dec. 17, 
2001, which are repeated here. 
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The combining weight is chosen at each baud instance by comparing the distance of the 
baseband soft decision, to its nearest element in the source constellation, and 

normalizing by the size of the decision region. This idea is illustrated in Figure 10, using 
a 16-QAM alphabet. 

The left-hand-side of Figure 10 shows a 16-QAM constellation 1010, and the right-hand- 
side is an exploded view of a single decision region 1020 for the constellation point 1025. 
The width of the decision region is 2A , and the distance of the scaled soft decision 350 
to the constellation point 1025 is therefore |M<n)-w(w)|. Excluding outermost 

constellation points that have open decision regions, the ratio X(n) =| w(n) - w{n) \ / V2A 
does not exceed unity. For those outermost constellation points, if X{n) exceeds unity, it 
is set to unity. Hence, on an instantaneous basis, A(n) is bounded between zero and one, 
and provides an instantaneous measure of signal integrity: when the scaled soft decision 
1050 is far fi-om the hard decision (constellation point) 1025, X(n) is close to unity; 

when the scaled soft decision 1050 is close to the constellation point 1025, X(n) is close 
to zero. 

To add memory or induce averaging to the instantaneous combining weight X(n) , a 
leaky integrator is used, and the value of combining weight A{n) is calculated as 

/lW = (l-p,).;i(/i-l) + /7,.I(,i) 

where is the leakage term and is chosen less than or equal to one and greater than or 
equal to zero. Also, the combining weight A{n) may be compared to two thresholds, 
and . If A(n) > , then X(n) is set to a register value, for example, unity; if 
Z{n) < , then X(n) is set to another register value, for example, zero. 
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A circuit used to calculate the combining weight in accordance with certain aspects 
of the present invention is shown in Figure 11. The baseband LMS error term is 
calculated and supplied to norm calculations 1105 and 1110, which calculate norms 

^2 =-^e^ +4 and ^, =k/ l + ke I, respectively, where is the real part and Bq is the 

imaginary part of the baseband LMS error term. Mux 1115 selects the desired norm 
calculation, and multiplier 1120 normalizes the result so that the maximum value of the 
product (neglecting constellation points with open-ended decision regions) is unity, 
according to the norm selected. Block 1 125 clips the result to unity, accounting for those 
constellation points with open-ended decision regions. 

Multiplier 1 130, adder 1 135, delay 1 140, and multiplier 1 145 comprise a leaky integrator, 
or averaging circuit. Multiplier 1130 scales the clipped value from block 1125 by the 
leakage amount . This product is summed with the output of multiplier 1 145 in adder 
1 135. The result is delayed in delay element 1 140, scaled by the leakage amount \- P;^ 
in multiplier 1 145 and then supplied to adder 1135. 

The output of adder 1 135 is first compared to upper threshold in comparator 11 55. If 
this first comparison is satisfied, the combining weight is set to programmable upper 
value Ty - value in assignment block 1165. If this first comparison is not satisfied, the 
output of adder 1135 is next compared to lower threshold . If this second comparison 
is satisfied, the combining weight is set to programmable lower value - value in 
assignment block 1170. If this second comparison is not satisfied, the output of adder 
1 135 remains unchanged in assignment block 1 175. 

An alternative embodiment of the present invention calculates a combining weight X{n) 
by indexing a programmable array of stored combining weight values, indexed by array 
index i provided by Array Index Update block 975. 
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Referring again to Figure 9, the Error Term and Combining Weight Calculator 905 
supplies candidate error terms to mux 910, and the baseband LMS error term is split into 
in-phase (I) and quadrature-phase (Q) components and supplied to absolute value blocks 
930 and 925, respectively. The absolute values of I and Q components are compared to 
threshold 7^ in comparison blocks 935 and 940, respectively. Comparison blocks 935 
and 940 output a single bit each, designating true or false comparisons. Furthermore, the 
threshold used in comparison blocks 935 and 940 is adjusted over time and supplied 
by Array Block 980. The output bits of comparison blocks 935 and 940 are logically 
compared in AND gate 945, generating control signal in_box. This control signal is 
therefore an instantaneous indicator of the proximity of the soft decision sample to the 
hard decision sample, conditioned upon the current value of threshold . 

A second control signal used in the Signal Processing, Logic, and Control Block 260 is 
generated based on the position of the hard decision sample in the source constellation, 
conditioned upon a prescribed template. For example, Slicer 270 supplies the symbol 
number (O..Jl^for M-ary QAM) prescribed to the hard decision sample in the 
constellation bit-to-symbol mapping (usually a type of grey code) to Region LUT 920. 
Region LUT 920 stores the desired template that is used to discern the second control 
signal. For example, Figure 12 shows two such templates, and associate a bit with each 
constellation point. The left-hand plot encompasses an approximate annulus, while the 
right-hand plot encompasses the 16 most inner constellation points. Each template is 
stored in a single column of Region LUT 920, with row index determined by the symbol 
number 0..J/. One preferred embodiment of the present invention uses 16 such 
templates, so that the size of Region LUT is Mxl6, and the array index i ranges from 
0.../J. The symbol number of the bit-to-symbol mapping provided by Slicer 270 
addresses the Region LUT 920 and selects a single row of the LUT contents, providing a 
16-bit resuh. This 16-bit result is addressed by the array index i in mux 985, producing 
a second control signal referred to as in^region. 
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A third control signal used in the Signal Processing, Logic, and Control Block 260 is a 
training sequence indicator and is denoted by TR. This control signal is valid when the 
current sample is part of a training sequence sent by the transmitter, instead of bearing 
user data. It is apparent to one skilled in the art how to derive a training sequence 
indicator, for example using correlation techniques. 

Control signals in_box and TR are logically compared in AND gate 950, and the result is 
supplied to an accumulator 970 that increments when the result is true. The accumulator 
970 operates until a clear signal is provided by Array Index Update block 975, at which 
time accumulator 970 supplies its accumulated count, designated as , to Array Index 
Update block 975, and resets its accumulator value to zero. 

Control signals in_box and in_region are logically compared in AND gate 955, and the 
result is supplied to AND gate 960, which logically compares this result with the 
complement of control signal TR. The result of AND gate 960 is supplied to accumulator 
965 that increments when the result is true. The accumulator 965 operates until a clear 
signal is provided by Array Index Update block 975, at which time accumulator 965 
supplies its accumulated count, designated as J^, to Array Index Update block 975, and 
resets its accumulator value to zero. 

Array Index Update block 975 calculates array index ^ ranging from 0.../5 in the 
preferred embodiment of the present invention, according to the flow diagram in Figure 
13. This circuit is executed at an integer multiple of the baud period, depending on the 
desired window length. For example, for systems operating in dynamic multipath 
conditions that are fast relative to the baud rate, this circuit may be executed every 64 or 
128 baud iterations; but for systems operating in slowly changing signaling conditions 
relative to the baud rate, this circuit may be executed only every l}^ or 2^^ baud iterations. 
Furthermore, the execution rate may be changed over the course of demodulator 
operation, for example, smaller at the start of demodulator operation to achieve rapid 
convergence, then increased once the array index has crossed a prescribed threshold. 
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The circuit in Figure 13 uses two arrays of length P (16 in a preferred embodiment of the 
present invention), denoted by Aj and , and one threshold, denoted by H. The values 
and J J, from accumulators 965 and 970, respectively, and the current value of array 
index £ (usually initialized to zero at demodulator power up) are also used in this circuit. 
The left hand side of the circuit is used to increment array index £ , while the right hand 
side is used to decrement array index Decision block 1305 checks to see that the 
current value of array index £ does not exceed the maximum value, P-L If the current 
value of array index £ is less than P-7, then decision block 1310 is entered; else, the 
array index £ cannot be incremented and decision block 1320 is entered. Decision block 
1310 compares from accumulator 970 to a predetermined threshold H . If J^>H 
then decision block 1315 is entered; else, decision block 1320 is entered. Decision block 
1315 compares from accumulator 965 to the value of increment array A^ that is 
indexed by the current value of array index £ , denoted by A^ {£) . \i J^> A, (£) then 
array index £ is incremented in increment block 1330 and output block 1340 is entered; 
else, decision block 1320 is entered. 

Decision block 1320 checks to see that the current value of array index £ does not 
exceed the minimum value, 0. If ^ > 0 is satisfied, then decision block 1325 is entered; 
else, output block 1340 is entered. Decision block 1325 compares from accumulator 
970 to the value of decrement array A^ that is indexed by the current value of array 
index £, denoted by y4^(^). If JD<ADiO then array index £ is decremented in 
decrement block 1335 and output block 1340 is entered; else, output block 1340 is 
entered directly. 

Output block 1340 sends "clear" signals to accumulators 965 and 970, resetting the count 
to zero in each, and loads another value of array index £, which has either been 
incremented, decremented, or is unchanged. 

Referring again to the Signal Processing, Logic, and Control Block 260 in Figure 9, the 
array index £ from Array Index Update block 975 is provided to Array Block 980. Array 
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Block 980 stores a length-P (16 in a preferred embodiment of the present invention) array 
of threshold values . The array index I indexes this array and selects a new threshold 
, which is provided to comparison blocks 935 and 940. Similarly, Array Block 980 
stores a length-P array of stepsize values for the preferred embodiment of Feedback AGC 
240, which is indexed by array index ^ . A new value of stepsize fi^ is selected by array 
index i and provided to Feedback AGC 240. 

For the alternative embodiment of Feedback AGC 240, Array Block 980 stores a length- 
P array of freeze bits. The freeze bit selected by array index i is provided to logical OR 
gate 990. 

Moreover, Array Block 980 stores a length-P array of godard radius values, y , one of 
which is used for CMA error term calculation. The elements stored in the array of 
godard radius values can be pre-calculated in accordance with the templates that are 
stored in Region LUT 920 corresponding to the current value of array index t . For 
example, since arbitrary templates of constellation points can be designed for inclusion in 
Region LUT 920, the CMA error term should be calculated based on the effective source 
constellation it sees (a subset of the full QAM source constellation), which could be the 
values stored in the present template of Region LUT 920, or their complement. This 
choice depends on the design of the Adaptation Select LUT 915. The godard radius 
selected by array index i is provided to Error Term and Combining Weight Calculator 
905, and used in the CMA error term calculation. 

Array Block 980 can also store arrays for other demodulator parameters, like stepsizes for 
adaptive filters 220 and 230. Each array is indexed by array index ^, and a new 
parameter is provided by Array Block 980 for demodulator operation. 

The three control signals, in_box, in region, and TR are used also as address bits to 
Adaptation Select LUT 915. Adaptation Select LUT 915 stores a programmable LUT 
that contains various control bits for demodulator operation, whose selected values are 
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contingent upon the current values of control signals in_box, in region, and TR. For 
example, Adaptation Select LUT 915 outputs two control bits that govern mux 910, 
which selects the error term used in update of Equalizer 175 coefficients, choosing from 
among LMS, CMA, combined LMS and CMA (using combining weight A{n) ), and zero, 
at each baud instance. An example adaptation strategy may be to use an LMS error term, 
derived from training data, when TR is true; CMA when in^region is true and in_box is 
not; a combination error term when both in_region and in_box are true, and a zero- valued 
error term otherwise. The Adaptation Select LUT 915 can be programmed to 
acconmiodate a variety of adaptation strategies, not limited to this example. 
Furthermore, the Adaptation Select LUT 915 can be reloaded over the course of 
demodulator operation, for example, contingent on the value of array index £, an 
estimate of demodulated SNR, or after a prescribed amount of time, so that the adaptation 
strategy is itself changed over the course of demodulator operation. 

The Adaptation Select LUT 915 contains control bits for other demodulator operations, 
too. For example, referring again to Figure 5, three control signals are shown that govem 
Carrier Loop 250 operation. Control signal 1 governs mux 510 that selects a hard 
decision sample from Sheer 270 or Coarse Tune Hard Decision Block 505. One strategy 
selects the hard decision sample from Slicer 270 when all of TR, in^region, and in_box 
are true, or when only in_region and in_box are true, and selects a hard decision sample 
from coarse tune Hard Decision Block 505 otherwise. Control signal 2 is used to govem 
mux 520, and may select a zero-valued phase error when TR, in region, and in box are 
false, indicating low quality of demodulator performance, instead of selecting the phase 
error generated by Phase Detector 515. Control Signal 3 is used to select the style of hard 
decision generated from Coarse Tune Hard Decision Block 505, selecting from among 
the five operating modes of Carrier Loop 250 previously described. 

Adaptation Select LUT 915 also contains control bits for Feedback AGC 177 operation. 
For the preferred embodiment of Feedback AGC 177, four operating modes have been 
identified, and can be selected on a symbol-by-symbol basis using the control bits for 
from Adaptation Select LUT 915. The association of operating mode to control signals 
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in_box, in^region, and TR depends upon the templates stored in Region LUT 920. For 
example, if Region LUT 920 stores an annulus (as described in the left subplot of Figure 
12), then a CM-like cost fiinction may be chosen when in_region is true; however, if 
Region LUT 920 stores inner constellation points (as described in the right subplot of 
Figure 12), then an LMS-like cost function may be chosen when in_region is true. 

If the altemative embodiment of Feedback AGC 177, illustrated in Figure 4, is used, then 
control bits from Adaptation Select LUT 915 determine the operating modes previously 
described for Figure 4. One of the control bits from Adaptation Select LUT 915 is 
provided to logical OR gate 990, together with a control bit from Array Block 980. The 
result from logical OR gate 990 is used to select the operating mode that determines AGC 
freeze. 

Adaptation Select LUT 915 also contains control bits for Timing Recovery Loop 170, 
that determine the operating mode from among the six operating modes previously 
discussed in Figure 7. Moreover, control bits are provided to Loop Filter 730 from 
Adaptation Select LUT 915 that govern loop constants for proportional and integral 
control. For example, one of these control bits if true selects loop constants from 
registered values set by the user or application software, or if false according to a 
predetermined table. One such table is below: 



Control Bits to Loop Filter 


Proportional Loop Constant 


Integral Loop Constant 


0 0 


Registered value 


Registered value 


0 1 


Registered value/8 


Registered value/8 


1 0 


Registered value/8 


Registered vaIue/16 


1 1 


0 


0 



The methods described in Figure 9 for the Signal Processing, Logic, and Control Block 
260 facilitate extremely flexible demodulator operation, since LUTs are programmable, 
and can be selected for a desired operational environment of the demodulator. For 
example, the templates stored in Region LUT 920 can be designed so that outer-most 
signal points are handled differently, to account for nonlinear effects caused by high 
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power amplifiers in the transmitter. Adaptation strategies for physical layer functions can 
be completely modified by simple LUT change, and if re-loaded while the demodulator is 
operating, adaptation strategies can be changed "on-the-fly". Furthermore, the coimters 
and telemetry points in the Signal Processing, Logic and Control Block 260 (Figure 9) 
can be useful indicators of demodulator health, and provide signal quality assessment to 
the user, aiding in antenna placement or other tasks needing receiver feedback. 

One skilled in the art would understand that the equations described herein may include 
scaling, change of sign, or similar constant modifications that are not shown for 
simplicity. One skilled in the art would realize that such modifications can be readily 
determined or derived for the particular implementation. Thus, the described equations 
may be subject to such modifications, and are not limited to the exact forms presented 
herein. 

Certain aspects and embodiments of the present invention have been described using 
Quadrature Amplitude Modulation (QAM) signals with complex signal processing, 
unless specifically noted. However, one skilled in the art would realize that the 
techniques described herein may be applied to a receiver processing Phase-Shift Keyed 
(PSK), Pulse Amplitude Modulation (PAM) , or other signals. 

As would be apparent to one skilled in the art, the various functions of equalization, 
signal combining, automatic gain control, carrier recovery, and timing recovery may be 
implemented with circuit elements or may also be implemented in the digital domain as 
processing steps in a software program. Such software may be employed in, for example, 
a digital signal processor, micro-controller, or general-purpose computer. 

The present invention can be embodied in the form of methods and apparatuses for 
practicing those methods. The present invention can also be embodied in the form of 
program code embodied in tangible media, such as floppy diskettes, CD-ROMs, hard 
drives, or any other machine-readable storage medium, wherein, when the program code 
is loaded into and executed by a machine, such as a computer, the machine becomes an 
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apparatus for practicing the invention. The present invention can also be embodied in the 
form of program code, for example, whether stored in a storage medium, loaded into 
and/or executed by a machine, or transmitted over some transmission medium, such as 
over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation, 
wherein, when the program code is loaded into and executed by a machine, such as a 
computer, the machine becomes an apparatus for practicing the invention. When 
implemented on a general-purpose processor, the program code segments combine with 
the processor to provide a unique device that operates analogously to specific logic 
circuits. 

It will be further understood that various changes in the details, materials, and 
arrangements of the parts which have been described and illustrated in order to explain 
the nature of this invention may be made by those skilled in the art without departing 
from the principle and scope of the invention as expressed in the following claim. 
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